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ABSTRACT 

I present a new procedure to forecast the Bayes factor of a future observation by 
computing the Predictive Posterior Odds Distribution (PPOD). This can assess the 
power of future experiments to answer model selection questions and the probability 
of the outcome, and can be helpful in the context of experiment design. 

As an illustration, I consider a central quantity for our understanding of the cos- 
mological concordance model, namely the scalar spectral index of primordial pertur- 
bations, n$- I show that the Planck satellite has over 90% probability of gathering 
strong evidence against ns = 1, thus conclusively disproving a scale-invariant spec- 
trum. This result is robust with respect to a wide range of choices for the prior on 
n s - 
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1 INTRODUCTION 

Many interesting questions in cosmology are not about pa- 
rameter estimation, but rather about model selection. For 
example, we might be interested in assessing whether a new 
parameter is needed in our model, or whether a theoretical 
prediction for the value of a parameter can be confirmed by 
data. 

These kind of questions often cannot be satisfactorily 
answered in the context of frequentist (sampling theory) 
statistics, but find their natural for mulation in th e frame- 
work o f Bayesian model selection (see ITrottal (|2007d ) ; iLiddlel 
120071 ) and references therein). Bayesian model selection 
aims at working out the support that the data can offer 
to a model, by balancing the quality of fit that a more 
complicated model usually delivers with a quantitative 
embodiment of Occam's razor, favouring simpler explana- 
tions whenever they are compatible with the observations 
at hand. This is usually expressed in terms of the Bayes 
factor between two competing models, which represents the 
amount by which our relative believe in the two model has 
changed after the arrival of the data. There is a growing 
body of work in cosmology and astrophysics applying 
various brands of mo del selection tools t o a broad range 
of qu es tions, see e.g. | Drell et all (I200QT) : lLoredo fc Lambl 
J2002I); iHobson fc McLachlanl i2003h ; ISlosar et all ll2005tl; 
ISaini et all (12004); lLazarides et al.1 (12004); iBeltran et all 



.20051); iKunzet alJ il2006l); iMarshall et all il2006l); 
iMagueiio fc Sorkinl (120071); iParkinson et al.l ^2000 ): iTrottal 
1 2007al lbl); lBevis et al.1 (|2007l ). 
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The purpose of this paper is to present a new method to 
forecast the probability distribution of the Bayes factor for a 
future observation, called PPOD (for "Predictive Posterior 
Odds Distribution" 

Posterior odds forecasting was first introduced in ITrottal 

(2005), which used a single model to describe the present 
data. This has i nspired furthe r deve l opme nts of a sim- 
ilar technique in IPahud et al.1 (|2006l . 120071 ). In particu- 
lar, IPahud et al.l (|2006t ) pointed out that the Bayes fac- 
tor forecasting ought to consider multiple models and av- 
erage over them. This approach is used in the present work. 
For a different a pproa ch to Bayes factor forecasting, see 
iMukheriee et all l|2006l ). which instead focuses on delineat- 
ing regions of parameter space where future observations 
have the ability of delivering high-odds model selection re- 
sults. 

The use of our PPOD technique is illustrated on a 
central parameter of the cosmological concordance model, 
namely the scalar spectral index for cosmological pertur- 
bations, ns, which can be related to the characte r istics of 
the inflationary potential (see e.g. iLeach fc Liddld l|2003t) ). 
One interesting question bears on whether the distribution 
of fluctuations is scale-invariant, i.e. whether a model with 
ns — 1 (the so-called Harrison-Zeldovich power spectrum) is 
supported by data. Current cosmological observatio ns sup- 
port t he view th at ns 7^ 1, wi t h odd s of about 17 : 1 (|Trottal 
l2007d ) (see also lPahud et all (|2006t ). who find odds of 8 : 1 



1 The method was called ExPO (f or "Expected Posterior Odds" ) 
in a previous version of this work [|Trottall2005h . I am grateful to 
Tom Loredo for suggesting the new, more appropriate name. 
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in favour of ns 7^ 1). In this paper, we derive a predictive 
distribution for ns for the Planck satellite - an European 
cosmic microwave background satellite due for launch next 
year - and present a forecast for the model selection outcome 
from Planck observations. 

This paper is organized as follows: in section[5]we briefly 
review the main concepts of Bayesian model comparison. 
We then introduce our PPOD technique in section [3] and we 
apply it to derive the probability distribution for the model 
selection outcome from Planck in section |4j also discussing 
the dependence on the choice of prior. Section [5] is devoted 
to presenting our conclusions. 



2 BAYESIAN MODEL COMPARISON 

In this section we briefly review Bayesian model comparison 

and introduce our notation. 

Bayesian inference (see e.g. Ijavnesl (|2003h : iMacKavl 

(2003))' is based on Bayes' theorem, which is a consequence 
of the product rule of probability theory: 

p(d\8,M)n(9\M) 



P (8\d,M) 



(1) 



p(d\M) 

On the left-hand side, the posterior probability for the pa- 
rameters 9 given the data d under a model M is proportional 
to the likelihood p(d\0,M) times the prior probability dis- 
tribution function (pdf), tt(9\M), which encodes our state of 
knowledge before seeing the data. In the context of model 
comparison it is more useful to think of -k(0\M) as an inte- 
gral part of the model specification, defin ing the prior avai l- 
able parameter space under the model M (|Kunz et al.| [2006). 
The normalization constant in the denominator of {1} is the 
marginal likelihood for the model M (sometimes also called 
the "evidence") given by 



p(d\M) 



p{d\0, M)vr(0|M)d0 



(2) 



where designates the parameter space under model M. In 
general, 9 denotes a multi-dimensional vector of parameters 
and d a collection of measurements. 

Consider two competing models Mo and Mi and ask 
what is the posterior probability of each model given the 
data d. By Bayes' theorem we have 



p(Mi\d) oc p(d\Mi)ir(Mi) (i = 0,l) 



(3) 



where p(d\Mi) is the marginal likelihood for Mi and 7r(Mj) is 
the prior probability of the ith model before we see the data. 
The ratio of the likelihoods for the two competing models is 
called the Bayes factor: 

p(d\M ) 



(4) 



p{d\M x y 

which is the same as the ratio of the posterior probabilities 
of the two models in the usual case when the prior is pre- 
sumed to be noncommittal about the alternatives and there- 
fore 7r(Mo) — n(Mi) — 1/2. The Bayes factor can be inter- 
preted as an automatic Occam's razor, which disfa vors com- 
plex m odels involving many parameters (see e.g. iMacKav 



2003) for details, as well as the discussion in iLiddle et al 
2007)). A Bayes factor B01 > 1 favors model Mo and in 



terms of betting odds it would prefer Mo over Mi with odds 
of B01 against 1. The reverse is true for Bqi < 1. 



It is usual to consider the logarithm of the Bayes fac- 
tor, for which the "Jeffreys' scale" for the strengt h of evi- 
dence offers an empirically c alibrated rule of thumb (jjeffrevsl 
1961; Ka ss fc Raftervll 19951 ). Different authors use different 
conventions to describe the strengt h of evidence - in this 
work we use the same convention of ITrottal l|2007d ). deem- 
ing values I ln_Boi| > 1; > 2.5; > 5.0 to constitute 'positive', 
'moderate' and 'strong' evidence, respectively. 

Evaluating the marginal likelihood integral of Eq. ((2)1 
is in general a computationally demanding task for multi- 
dimensional parameter spaces. Several techniques are avail- 
able on the market, each with its o wn strengths and 
weaknesses: thermo dynamic integration (|Slosar et al ]l2003l: 
iBeltran et alj20()5[ ) , nested sampling (introduced by Shilling 
1 20041) and imp l ement e d in the cosmo l ogica l context 
by iBassett et al.1 (|2004h : iMukheriee et all (|2006l )). or the 
Savage -Dickey density ratio (SDDR), introduced in ITrottal 
(2007c). Since the method present here makes use of the 
S DDR, we brief ly remind the reader about it, referring 
to ITrottal (|2007d ) for further details. 

If we wish to compare a two-parameters model Mi with 
a restricted submodel Mo with only one free parameter, ip, 
and with fixed to — and assuming further that the prior is 
separable (which is usually the case in cosmology), i.e. that 

7r(w,V|Mi) = 7rHAfi)7r(^|Mo), (5) 

then the Bayes factor B01 of Eq. @ can be written as 

p(ui\d, Mi 



B01 = 



7r(w|Mi 



(SDDR). 



(6) 



Thanks to the SDDR, the evaluation of the Bayes factor 
of two nested models only requires the properly normalized 
value of the marginal posterior at w = under the extended 
model Mi , which is a by-product of parameter inference. We 
note that the derivation of ([6} does not involve any assump- 
tion about the posterior distribution, and in particula r about 
its norm ality. As it has been shown in Appendix C of lTrottal 
(2007c). the SDDR works well if the parameter value under 
the simpler model, oj*, is not too far away from the mean of 
the posterior under the extended model. The reason for this 
is that it becomes increasingly cumbersome to reconstruct 
the posterior with enough accuracy in the tails of the distri- 
bution. More specifically, for distributions close to Gaussian, 
Eq. ((SJl is likely to be reliable if u)± is less than about 3 stan- 
dard deviations away from the mean of the posterior. 

We now turn to describing our forecast technique allow- 
ing to obtain a probability distribution for the Bayes factor 
from future observations. 



3 BAYES FACTOR FORECAST: PPOD 

In designing a new observation, it is interesting to assess its 
potential in terms of its power to address model compari- 
son questions. To this end, we introduce a new technique 
which combines a Fisher information matrix forecast with 
the SDDR formula to obtain a forecast for the Bayes factor 
of a future observation. The result is a PPOD (for "Pre- 
dictive Posterior Odds Distribution") for the future model 
comparison results. 
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3.1 The predictive distribution 

We are interested in predicting the distribution of future 
data, from which the result of a future model comparison 
can be obtained. The predicting distribution for future data 
D is 



p{D\d) = Y J P(.D\d,M i )p{M i \d) 



i=0 
1 



(7) 



J2p{Mi\d) p(D\e,Mi)p(e\d,Mi)de, 



where the sum runs over the 2 competing models we are 
considering. Generalization to a larger number of mod- 
els is straightforward. In the above, p(D\8, Mi) is the pre- 
dicted likelihood for future data, assuming 8 is the correct 
value for the cosmological parameters (under model Mi). 
A Gaussian approximation to the future likelihood can be 
obtained by performing a Fisher Matrix analysis (FMA) as- 
suming 8 as a fiducial model. This yields a forecast of the 
parameters covariance matrix C fo r future data D (for a de - 
tailed account, see e.g. lKnoxl(ll995l);lKosowskv et all (|l996h : 
lEfstathiou fc Bond! ljl999h : lRocha et all l|2004 )\ 

The corresponding predictive posterior odds distribu- 
tion (PPOD) for the future Bayes factor, Box, conditional 
on current data d is then 

p{B m \d) = J p(B i,D\d)dD = j P (B i\D,d)p{D\d)dD 



= J 5{D - Boi(D))p(D\d)dD 



(8) 

where S denotes the Dirac delta-function, and Boi(D) de- 
notes the functional relationship between future data and 
the Bayes factor, given in our case by the SDDR, Eq. ©. 
The presence of the delta-function comes from the univocal 
relationship between the future data and Boi (see Eq. (|13[) 
below). In other words, the Bayes factor is simply a derived 
parameter of the future likelihood. 

It is instructive to consider the Gaussian case, whose 
PPOD can be written down analytically. We restrict our- 
selves to the case of nested models, and we write for the pa- 
rameter space of the extended model 8 = (uj, tp), where uj de- 
notes the extra parameter. If the predicted likelihood covari- 
ance matrix C does not depend on ip (in other words, if the 
future errors do not depend on the location in the subspace 
of parameters common to both models), it is easy to see 
from Eq. (0 that one can marginalize over the parameters 
common to both models, tp. Thus we can assume without 
loss of generality a 1-dimensional Mi compared with model 
Mo with no free parameters. We take a Gaussian prior on 
the extra parameter, centered around and of width equal 
to unity (this can always been achieved by suitably rescaling 
and shifting the variables), that we denote by 



p(u\Mi) = N ,i{uj) 



(9) 



and describe the present-day likelihood as a Gaussian cen- 
tered on uj = jj, of width <r, where (fi, a) are understood to 

2 An earlier version of this work did not carry out the sum over 
models, but was restricted to the i = 1 term of Eq. J7J. I am 
grateful to Andrew Liddle for bringing t his to my attention. This 
is also spelled out in iPahud et ail I2OO6I) . 



be expressed in units of the prior width and are thus dimen- 
sionless: 



p(d\u,Mi) = N„, a (uj) 



(10) 



The predicted likelihood under future data D is also Gaus- 
sian distributed, with mean uj — v and (constant) standard 
deviation r: 



p{D\uj,M x ) = N u>t (oj). 



(11) 



Here, the forecasted error r = vCii is taken to be inde- 
pendent on uj, and is understood to be the marginal error 
on uj, after marginalizing over the common parameters tp. 
Using Eqs. (JSHHJ into we obtain after a straightforward 
calculation 



T(T \ 2 

p{M x ) 



n 2 2 1 22 
1 v a + fj, T 



V V 2 + a 2 + t 2 <t- 



: exp 



1 (y - n) 2 + crV + tV 

2 T 2 + a 2 + r 2 a 2 



(12) 

where we have dropped irrelevant constants. As a function of 
the future mean v, Eq. (|12[) gives the probability of obtaining 
a value uj = v from a future measurement, conditional on the 
present data d and on the current model selection outcome. 
The PPOD can be obtained from (JSj and (JT2J by using the 
relation between v 2 and lnBoi (obtained by applying the 
SDDR): 



t^(1+t 2 ) In 



1 



2ttt 2 



21nS ] 



(13) 



For v = 0, corresponding to the future observation mea- 
suring the predicted value of uj under Mo, Eq. (|13|) gives 
the maximum odds in favour of model Mo one can hope to 
gather from a future measurement with error r. 

In the general case, where the current likelihood is non- 
Gaussian and the future likelihood covariance matrix can 
depend on 8, it is possible to compute p(D\d) numerically 
from a series of MCMC samples. By using a s imilar manipu - 
lation as the one illustrated in Appendix B of lTrottal (|2007cf ) 
to obtain the SDDR formula, we can recast the i — term 
in the sum (JTJl as 

p(M \d) j p{D\^,M )pWd,M )d^ = 

Boi^y^ j p(D\^,Mo)p{d\'4),^,M l )p{ii,u3 i ,\Mx)d'4). 

(14) 

Since the constant factor p(d) 1 is common to both terms 
in the sum and hence factors out, knowledge of the un- 
normalized posterior under Mi and of the present-day Bayes 
factor Boi is sufficient to compute the predictive data dis- 
tribution and therefore the PPOD by employing Eq. (|13p . 
Given N independent samples from the un-normalized pos- 
terior under Mi, p{d\ip , uj , M\)p(ip , lj|Mi) , which can be ob- 
tained by standard MCMC techniques, one proceeds to per- 
form a FMA at every sample, thus obtaining a prediction 
for the future covariance matrix at that point in parameter 
space. Let us denote the MCMC samples by 8j — (ujj,i/jj), 
j — 1,...,N. The predictive data distribution (0 is ob- 
tained by averaging the future likelihood over the samples, 
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i.e. using Eq. (fl4|) 



1 

pp|d)ocp(Mi)-^p(D|u^,Mi)+ 



1 

p(M )Bbi-^53p(I>K,V*,Mi) 



(15) 



where if is the number of samples in the chain with lu — LU* 
(or within a suitably small neighbourhood from cj*) and we 
have dropped an overall normalization factor p(d)~ . The 
corresponding PPOD for Boi can then be obtained using 
Eqs. (fl3)l and Q. 

The predictive distribution of Eq. (|15|l does not make 
any assumptions regarding the normality of the current pos- 
terior, nor of the prior. However, it does assume that the 
future likelihood can be described by a Gaussian distribu- 
tion, as is implicit in the use of the FMA. This aspect is 
not so critical, since FMA errors have proved to give reli- 
able estimates, especia lly when using "normal parameters" 
ijKosowskv et alj|2002l ) . The second assumption is hidden in 
Eq. (113[1 , which relates the future Bayes factor Boi to the fu- 
ture mean, v. This relation only holds for a Gaussian prior 
and assuming that the posterior for future data is accurately 
described by a Gaussian, which is likely to break down in 
the tails of the distribution, \v — w*|/r 2> 1. Nevertheless, 
we can still conclude that models which have \v — w*|/t 2> 1 
strongly disfavor Mo under future data, even though we can- 
not attach a precise value to the expected odds. This is why 
we present PPOD results by giving only the integrated prob- 
ability within a few coarse regions, as in Table [1] We notice 
that one could improve on both of the above assumptions 
by using MCMC techniques to sample from the future like- 
lihood rather than using a Gaussian approximation. This 
however would add considerably to the computational bur- 
den of the forecast. 



3.2 Extension to experiment design 

Our approach can be extended to the context of Bayesian 
experiment design, whose goal is to optimize a future ob- 
servation in order to achieve the maximum science return 
(often defined in terms of inf ormation gain or through 

and 



of inf ormation 
a suitable figure of merit, s ee lLoredd (12 003) 



refer- 



ences th erein for an overv i ew. | Bassettl l|2005h ; iBassett et al.l 
l|2005h : I Parkinson et alj (12007 ) for a more cosmology- 
oriented application and iFordl (|2004T ) for an astrophysical 
application) . 

The core of the procedure is the quantification of the 
utility of an experiment as a function of the experimental 
design, possibly subject to experimental constraints (such 
as observing time, sensitivity, noise characteristics, etc). 
The observing strategy and experiment design are then op- 
timized to maximise the expected utility of the observa- 
tion. The PPOD is a good candidate for an utility func- 
tion aimed at model comparison, for it indicates the prob- 
ability of reaching a clear-cut model distinction thanks to 
the future observation. The dependence on experimental de- 
sign parameters is implicit in the FMA, and therefore one 
could imagine optimizing the choice of experimental param- 
eters to maximise the probability of obtaining large posterior 
odds from the future data, integrating over current posterior 



knowledge. This is especially interesting since it marginalizes 
over our current uncertainty in the value of the parameters, 
rather than assuming a fiducial model as it is usually done 
in Fisher matrix forecasts common in the literature. 

Since in the present paper we focus on model compar- 
ison rather than experiment design, in the following we fix 
the experime ntal paramete r s for the Planck satellite to the 
value used in iRocha et al] l|2004l) . We leave further explo- 
ration of the issue of design optimization and PPOD for 
future work. 



4 FORECASTS FOR THE PLANCK 
SATELLITE 

In this section we investigate the potential of the Planck 
satellite in terms of model comparison results. For other 
works using a similar technique, parti ally inspired by our 
approach, see lPahud et~"ai1 ifcood 120071 ') 



4.1 Parameter space and current cosmological 
data 

As current cosmological data, we us e the WMAP 3-year 
temperature and polarization data dHinshaw et al.l 120061 ; 
I Page et al.l 



20061) supplemented by small-scale C MB mea- 



surements (|Readhead et al.ll2004l ; iKuo et al.ll2004 ). We add 
the Hubble Space Telescope measurement of the Hub ble con- 
stant H = 72±8 km/s/Mpc dFreedman et al.ll200ll ) and the 
Sloan Digital Sky Survey (SDSS) data on th e matter power 
spect rum on linear scales (k < 0.1/i _1 Mpc) (ITegmark et al.l 
l2004 l. Furthermore, we shall al so consider super novae lumi- 
nosity distance measurements (|Riess et alj|2004l ). We make 
use of the publ i cly av ailable codes CAMB and CosmoMC 
iLewis fc Bridle! l|2002i ) to compute the CMB and matter 
power spectra and to construct Monte Carlo Markov Chains 
(MCMC) in parameter space. We sample uniformly over 
the physical baryon and cold dark matter (CDM) densi- 
ties, cot = flbh 2 and uj c = Q. c h 2 , expressed in units of 
1.88 x 10 -29 g/cm 3 ; the ratio of the angular diameter dis- 
tance to the sound horizon at decoupling, O*, the optical 
depth to reionization r r (assuming sudden reionization) and 
the logarithm of the adiabatic amplitude for the primordial 
fluctuations, Inl0 10 y4s. When combining the matter power 
spectrum with CMB data, we marginalize analytically over 
a bias b considered as an additional nuisance parameter. 
Throughout we assume three massless neutrino families and 
no massive neutrinos, we neglect the contribution of gravi- 
tational waves to the CMB power spectrum and we assume 
a flat Universe. 



4.2 PPOD forecast for the spectral index 

From the current posterior we can produce a PPOD forecast 
for the Planck satellit^fl following the procedure outlined in 
Section[3] As motivated in the introduction, we focus on the 
sc alar spectral in dex ns and we follow the same setup as 
m iTrottal (|2007d ). comparing an Harrison-Zeldovich (HZ) 
ns = 1 model against a generic inflationary model with a 



See the website: http://astro.estec.esa.nl/Planck 
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Gaussian prior of width Ang = 0.2, as motivated by slow- 
roll inflation. In lTrottal (|2007cj ) it was shown that a compila- 
tion of present-day CMB, large scale structure, supernovae 
and Hubble parameter measurements yields moderate odds 
(17 : 1) in favour of ns 7^ 1. 

The result in terms of the predictive data distribution 
is shown in Figure [TJ and the corresponding PPOD for the 
Bayes factor is given in Table [1] for our choice of the prior 
scale, Ans = 0.2 (see below for a discussion of the depen- 
dence of our results on the prior choice) . In Figure [1] we 
plot p(D\d) for Planck conditional on present-day informa- 
tion both as obtained numerically from the MCMC chains, 
via Eq. (|15|l , and by using the Gaussian approximation with 
constant future errors, Eq. 1)120 . with a = 0.015/ Ans = 
0.075, t = 0.004/An s = 0.02 and fi = -0.05/An s = -0.25 
(all these quantities are expressed in units of the prior width, 
Ans = 0.2). We observe that Eq. 1 )120 is an extremely good 
approximation to the full numerical result, obtained from 
2000 thinned samples of a MCMC chain. This follows from 
the facts that the current posterior is close to Gaussian, and 
that the future errors forecasted for Planck vary only very 
mildly over the range of parameter space singled out by the 
present posterior. Furthermore, the future errors are almost 
uncorrelated with the fiducial value of ns- 

We then obtain the PPOD numerically via Eq. (fT3")l 
and we integrate the distribution to get the probability of 
the model comparison result from future data (tabulated in 
Table [TJ. The main finding is that Planck has a very large 
probability (Pr(ln B01 < —5) = 0.928) to obtain a high-odds 
result strongly favouring a spectral tilt over an HZ spectrum. 
This is consequence of the fact that the most probable mod- 
els under current data are clustered around ns = 0.95 and 
that Planck sensitivity will decrease the error around those 
models by a factor ~ 4. The region of the predictive distri- 
bution corresponding to decisive odds in favour of ns 7^ 1 is 
shown in green in the inset of Figure [TJ and it extends to all 
values ns 0.984. By contrast, the probability that Planck 
will overturn the present model selection result fa vouring 
n s 7^ 1 (currently with odds of about 17 : 1, see iTrottal 
l|2007ch ') is only around 6%. We also find that the maximum 
odds by which Planck could favour ns = 1 are of 20 : 1, or 
In 601 = 3.00 (for our choice of prior width), which would 
still fall short of the mark of "strong" evidence. It is inter- 
esting to note from Table [TJ that either temperature infor- 
mation or E-polarization information alone will be enough 
to deliver a high odds result with large probability (around 
90% in either case). 

The a bove findings are in good agreement with the con- 
clusions in lPahud et al.l (2006), which were obtained using a 
more qualitative version of our procedure. The PPOD proce- 
dure presented here improves on several, pote ntially impor- 
tant a s pects with respect to the method used in lPahud et al.l 
l|2006l . l2007h : PPOD takes into account the full predictive 
distribution, and in particular the potentially important 
tails of the distribution above ns = 1; it fully accounts for 
the possibility that ns = 1 but that Planck will actually end 
up (wrongly) favouring the HZ model because of a measure- 
ment in the tail of the predictive distribution for Mo; finally, 
it takes into account the effect due to the variation of the 
future error on ns across the current posterior (even though 
this aspect has been shown to be negligible in the present 
case). 



Table 1. Probability of future model comparison results (PPOD) 
for the Planck satellite, conditional on present knowledge. There 
is about 93% probability that Planck will be able to strongly 
favour ns 7^ 1> i.e. that it will obtain lnBoi < —5. The probabil- 
ity that Planck will favour a scale-invariant spectrum (In B01 > 0) 
is only about 6%. These results are robust even when using only 
temperature (TT only column) or only E-polarization informa- 
tion (EE only column). 



Spectral index: ns = 1 versus 0.8 < ns < 1.2 (Gaussian) 





All 


EE only 


TT only 


Pr(lnS i < -5) 


0.928 
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0.926 


Pr(-5 < lnBoi < -2.5) 


0.005 


0.018 
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Pr(-2.5 < lnSoi < 0) 


0.006 
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Pr(lnBoi > 0) 


0.061 


0.056 


0.059 
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Figure 1. Predictive data distribution for the Planck satellite 
nominal mission, conditional on current (WMAP3+ext) knowl- 
edge. We are plotting the probability distribution (normalized 
to the peak) of the future measurement of the spectral tilt. The 
bump at ns = 1 corresponds to the probability associated with 
the HZ model. The black curve is obtained numerically from 
MCMC chains (Eq. 115> 1 while the red is for the Gaussian approx- 
imation, Eq. 1)12) 1. In the inset, the shaded areas delimit regions 
where the Bayes factor from Planck deliver strong evidence in 
favour of ns 7^ 1 (green, lnBoi < —5.0, this region extending 
to all smaller ns values), moderate evidence for ns 7^ 1 (cyan, 
—5.0 < lnBoi < —2.5), positive evidence for ns 7^ 1 (magenta, 
-2.5 < lnBoi < 0.0) or favour n s = 1 (yellow, lnS i > 0). The 
corresponding probability values are reported in Table [D 



4.3 Dependence on the choice of prior 

The prior assignment is an irreducible feature of Bayesian 
model selection, as it is clear from its presence in the de- 
nominator of Eq. ([6]). In fact, the prior width controls the 
strength of the Occam's razor effect on the extended model, 
and thus a larger prior favours the simpler model. 

We can assess the impact of a change of prior on our 
PPOD results by plotting them as a function of the chosen 
prior width. In Figure [2] we show how the probabilities for 
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of the concordance model. We have found that the Planck 
satellite has over 90% probability of obtaining a strong 
(lnBoi < —5) model selection result favouring ns / 1 (for 
a prior width Ans = 0.2), thus improving on current, mod- 
erate odds (of about 17 : 1 or ln£>oi = —2.86). The proba- 
bility that Planck will find evidence in favour of ns — 1 is 
by contrast only about 6%. These results are qualitatively 
unchanged for a wide range of prior values, encompassing 
most reasonable prior choices. 
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Figure 2. PPOD dependence on the prior width for ng. We plot 
the PPOD result computed from the Gaussian approximation of 
Eq. 112> as a function of the width of the prior on rig for the 
model with ng 7^ 1. In order to change the conclusion of this 
work, namely that Planck has a large probability of conclusively 
measuring ns 7^ 1, one would have to adopt a prior larger than 
about 3.5 (crossing of the green and yellow lines). In this work, 
the prior width has been set to 0.2 (dotted, vertical line). 

Planck to obtain different levels of evidence for or against 
n s 7^ 1 change with a change in the choice of the prior 
width Ang. It is apparent that our result holds true for a 
wide range of prior values: even if the prior is widened to 
Ans = 1, the probability of a strong (lnBoi < —5, green 
line) result in favour of n s 7^ 1 is still about 80%. The prior 
width has to be enlarged to Ans 3.5 for the simpler model 
to have more than 50% probability of being favoured (yellow 
line, depicting the probability of obtaining In 601 > 0). 



5 CONCLUSIONS 

We have presented a new statistical technique (PPOD) to 
produce forecasts for the probability distribution of the 
Bayes factor from future experiments. The use of PPOD 
can complement the Fisher matrix forecasts in that it al- 
lows to assess the capabilities of a future experiment to ob- 
tain a high-odds model selection result. Being conditional 
on present knowledge, our PPOD technique does not assume 
a fiducial model, but takes into account the current uncer- 
tainty in the values of the underlying model parameters. 

We emphasize that the PPOD forecast, being condi- 
tional on the present posterior, is reliable provided there 
will be no major systematic shift in the parameter deter- 
mination with respect to present-day data. In other words, 
the PPOD only takes into account the statistical proper- 
ties of our knowledge, a point hardly worth highlighting (if 
we knew the outcome of a future measurement, it would be 
pointless to carry it out). 

We have applied this method to a central parameter 
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